Khmer | |
---|---|
ភាសាខ្មែរ | |
Pronunciation | IPA: [pʰiːəsaː kʰmaːe] |
Spoken in | Cambodia, Vietnam, Thailand, USA, France, Australia |
Ethnicity | Khmer |
Native speakers | 15 million (2006) 1 million L2 speakers |
Language family |
Austro-Asiatic
|
Writing system | Khmer script (abugida) |
Official status | |
Official language in | Cambodia |
Regulated by | No official regulation |
Language codes | |
ISO 639-1 | km |
ISO 639-2 | khm |
ISO 639-3 | either: khm – Central Khmer kxm – Northern Khmer |
Khmer (ភាសាខ្មែរ, IPA: [pʰiːəsaː kʰmaːe]; or more formally, ខេមរភាសា, IPA: [kʰeɛmaʔraʔ pʰiːəsaː]), or Cambodian, is the language of the Khmer people and the official language of Cambodia. It is the second most widely spoken Austroasiatic language (after Vietnamese), with speakers in the tens of millions. Khmer has been considerably influenced by Sanskrit and Pali, especially in the royal and religious registers, through the vehicles of Hinduism and Buddhism. It is also the earliest recorded and earliest written language of the Mon–Khmer family, predating Mon and by a significant margin Vietnamese.[1] The Khmer language has influenced, and also been influenced by, Thai, Lao, Vietnamese and Cham, all of which, due to geographical proximity and long-term cultural contact, form a sprachbund in peninsular Southeast Asia.[2]
The Khmer language is written with an abugida known in Khmer as âksâr khmêr. Khmer differs from neighboring languages such as Thai, Lao and Vietnamese in that it is not a tonal language.
The main dialects, all mutually intelligible, are:
Contents |
Linguistic study of the Khmer language divides its history into four periods one of which, the Old Khmer period, is subdivided into pre-Angkorian and Angkorian.[5] Pre-Angkorian Khmer, the language after its divergence from Proto-Mon–Khmer until the ninth century, is only known from words and phrases in Sanskrit texts of the era. Old Khmer (or Angkorian Khmer) is the language as it was spoken in the Khmer Empire from the 9th century until the weakening of the empire sometime in the 13th century. Old Khmer is attested by many primary sources and has been studied in depth by a few scholars, most notably Saveros Pou, Phillip Jenner and Heinz-Jürgen Pinnow. Following the end of the Khmer Empire the language lost the standardizing influence of being the language of government and accordingly underwent a turbulent period of change in morphology, phonology and lexicon. The language of this transition period, from about the 14th to 18th centuries, is referred to as Middle Khmer and saw borrowing from Thai, Lao and, to a lesser extent, Vietnamese. The changes during this period are so profound that the rules of Modern Khmer can not be applied to correctly understand Old Khmer. The language became recognizable as Modern Khmer, spoken from the 19th century till today.[5]
The following table shows the conventionally accepted historical stages of Khmer (Sidwell 2009:107).[6]
Historical stage | Date |
---|---|
Pre- or Proto-Khmer | Before 600 CE |
Pre-Angkorian Old Khmer | 600–800 CE |
Angkorian Old Khmer | 800 to mid-1300s |
Middle Khmer | Mid-1300s to 1700's |
Modern Khmer | 1800–present |
Khmer is classified as a member of the Eastern branch of the Mon–Khmer language family, itself a subdivision of the larger Austroasiatic language group, which has representatives in a large swath of land from Northeast India down through Southeast Asia to the Malay Peninsula and its islands. As such, its closest relatives are the languages of the Pearic, Bahnaric, and Katuic families spoken by the hill tribes of the region.[7] The Vietic languages have also been classified as belonging to this family.
The phonological system described here is the inventory of sounds of the spoken language, not how they are written in the Khmer alphabet.[8]
Most Cambodian dialects are not tonal. However, the colloquial Phnom Penh dialect has developed a marginal tonal contrast (a level versus a peaking tone) to compensate for the elision of /r/.[9]
Khmer once had a phonation distinction in its vowels, which was indicated in writing by choosing between two sets of letters for the preceding consonant according to the historical source of the phonation. However, phonation has been lost in all but the most archaic dialect of Khmer (Western Khmer).[10] For example, Old Khmer distinguished voiced and unvoiced pairs as in *kaa vs *ɡaa. The vowels after voiced consonants became breathy voiced and diphthongized: *kaa, *ɡe̤a. When consonant voicing was lost, the distinction was maintained by the vowel: *kaa, *ke̤a, and later the phonation disappeared as well: [kaː], [kiə].[9]
Labial | Dental[11]/Alveolar | Palatal | Velar | Glottal | |
---|---|---|---|---|---|
Plosive | p (pʰ) | t (tʰ) | c (cʰ) | k (kʰ) | ʔ |
Implosive | ɓ ~ b | ɗ ~ d | |||
Nasal | m | n | ɲ | ŋ | |
Liquid | r l | ||||
Fricative | s | h | |||
Approximant | ʋ | j |
Khmer is frequently described as having aspirated stops. However, these may be analyzed as consonant clusters, /ph, th, ch, kh/, as infixes can occur between the stop and the aspiration (phem, p<an>hem), or as non-distinctive phonetic detail in other consonant clusters, such as the khm in Khmer.[9][12] [b] and [d] are occasional allophones of the implosives.
In addition, the consonants /f/, /ʃ/, /z/ and /ɡ/ may occasionally occur in recent loan words in the speech of Cambodians familiar with French and other languages. These non-native sounds are not represented in the Khmer script, although combinations of letters otherwise unpronounceable are used to represent them when necessary. In the speech of those who are not bilingual, these sounds are approximated with natively occurring phonemes:
Foreign Sound (IPA) | Khmer Representation | Khmer Approximation (IPA) |
---|---|---|
/ɡ/ | ហ្គ | /k/ |
/ʃ/ | ហ្ស | /s/ |
/f/ | ហ្វ | /h/ or /pʰ/ |
/z/ | ហ្ស | /s/ |
There is little agreement as to the vowels of Khmer. This may be in part because political centralization has not yet been achieved, so standard Khmer does not prevail throughout Cambodia. As such, many speakers of even the same community may have different phonological inventories.[13] Two proposals follow:
Long vowels | iː | eː | ɛː | ɨː | əː | aː | uː | oː | ɔː | |
---|---|---|---|---|---|---|---|---|---|---|
Short vowels | i | e | ɨ | ə | ɐ | a | u | o | ||
Long diphthongs | iə̯ | ei̯ | ɐe̯ | ɨə̯ | əɨ̯ | ɐə̯ | ao̯ | uə̯ | ou̯ | ɔə̯ |
Short diphthongs | eə̯̆ | uə̯̆ | oə̯̆ |
Long vowels | iː | e̝ː | eː | ɛː | ɯː | ə̝ː | əː | aː | uː | o̝ː | oː | ɔː |
---|---|---|---|---|---|---|---|---|---|---|---|---|
Short vowels | i | e | ɛ | ɯ | ə | a | u | o | ɔ | |||
Long diphthongs | iə̯ | aɛ̯ | aə̯ | o̞u̯ | ao̯ | |||||||
Short diphthongs | ɛə̯ | ʷɔ |
The precise number and the phonetic value of vowel nuclei vary from dialect to dialect. Short and long vowels of equal quality are distinguished solely by duration.
Khmer words are predominantly either monosyllabic or sesquisyllabic, with stress falling on the final syllable.[15] There are 85 possible clusters of two consonants at the beginning of syllables and two three-consonant clusters with phonetic alterations as shown below:
p | ɓ | t | ɗ | c | k | ʔ | m | n | ɲ | ŋ | j | l | r | s | h | ʋ | tʰ | kʰ | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
p | pʰt- | pɗ- | pʰc- | pʰk- | pʔ- | pʰn- | pʰɲ- | pʰŋ- | pʰj- | pʰl- | pr- | ps- | pʰ- | ||||||
t | tʰp- | tɓ- | tʰk- | tʔ- | tʰm- | tʰn- | tʰŋ- | tʰj- | tʰl- | tr- | tʰ- | tʰʋ- | |||||||
c | cʰp- | cɓ- | cʰk- | cʔ- | cʰm- | cʰn- | cʰŋ- | cʰl- | cr- | cʰ- | cʰʋ- | ||||||||
k | kʰp- | kɓ- | kʰt- | kɗ- | kʰc- | kʔ- | kʰm- | kʰn- | kʰɲ- | kŋ- | kʰj- | kʰl- | kr- | ks- | kʰ- | kʰʋ- | |||
s | sp- | sɓ- | st- | sɗ- | sk- | sʔ- | sm- | sn- | sɲ- | sŋ- | sl- | sr- | sʋ- | stʰ- | |||||
ʔ | ʔʋ- | ||||||||||||||||||
m | mt- | mɗ- | mc- | mʔ- | mn- | mɲ- | ml- | mr- | ms- | mh- | |||||||||
l | lp- | lɓ- | lk- | lʔ- | lm- | lŋ- | lh- | lʋ- | lkʰ- |
Syllables begin with one of these consonants or consonant clusters, followed by one of the vowel nuclei. The aspiration in some clusters is allophonic.[12] When the vowel nucleus is short, there has to be a final consonant. /p, t, c, k, ʔ, m, n, ɲ, ŋ, l, h, j, ʋ/ can exist in a syllable coda, while /h/ and /ʋ/ become [ç] and [w] respectively. The stops /p, t, c, k/ are unreleased when occurring as syllable finals.
The most common word structure in Khmer is a full syllable as described above, which may be preceded by an unstressed, “minor” syllable that has a consonant-vowel structure of CV-, CrV-, CVN- or CrVN- (N is any nasal in the Khmer inventory). The vowel in these preceding syllables is usually reduced in conversation to [ə], however in careful or formal speech and in TV and radio, they are always clearly articulated.
Words with three or more syllables exist, particularly those pertaining to science, the arts, and religion. However, these words are loanwords, usually derived from Pali, Sanskrit, or more recently, French.
Khmer is generally a subject–verb–object (SVO) language with prepositions.[16] Although primarily an isolating language, lexical derivation by means of prefixes and infixes is common but not always productive in the modern language.[17]
Adjectives, demonstratives and numerals follow the noun they modify. Adverbs likewise follow the verb. Morphologically, adjectives and adverbs are not distinguished with many words often serving either function. Similar to other languages of the region, intensity can be expressed by reduplication.
ស្រីស្អាតនោះ /srəj sʔaːt nuh/ (girl pretty that) = that pretty girl
ស្រីស្អាតស្អាត /srəj sʔaːt sʔaːt/ (girl pretty pretty) = a very pretty girl
As Khmer sentences rarely use a copula, adjectives are also employed as verbs. Comparatives are formed by the use of /ciəng/: "A X /ciəng/ B" (A is more X than B). The most common way to express the idea of superlatives is the construction "A X /ciəng ke:/" (A is X-est of all).
The noun has no grammatical gender or singular/plural distinction and is uninflected. Technically there are no articles, but indefiniteness is often expressed by the word for "one" following the noun. Plurality can be marked by postnominal particles, numerals, or reduplicating the adjective, which, although similar to intensification, is usually not ambiguous due to context.
ឆ្កែធំ /cʰkae tʰom/ (dog large) = large dog
ឆ្កែធំធំ /cʰkae tʰom tʰom/ (dog large large) = a very large dog or large dogs
ឆ្កែធំណាស់ /cʰkae tʰom nah/ (dog large very) = very large dog
ឆ្កែពីរ /cʰkae piː/ (dog two) = two dogs
Classifying particles for use between numerals and nouns exist although are not always obligatory as in, for example, Thai. Pronouns are subject to a complicated system of social register, the choice of pronoun depending on the perceived relationships between speaker, audience and referent (see Social registers below). Kinship terms, nicknames and proper names are often used as pronouns (including for the first person) among intimates. Frequently, subject pronouns are dropped in colloquial conversation.
As is typical of most East Asian languages,[18] the verb does not inflect at all; tense and aspect can be shown by particles and adverbs or understood by context. Most commonly, time words such as "yesterday", "earlier", "tomorrow", indicate tense when not inferrable from context. There is no participle form. The gerund is formed by using /kəmpɔːŋ/: "A /kəmpɔːŋ/ V" (A is in the process of V). Serial verb construction is quite common. Negation is achieved by putting /min/ before them and /teː/ at the end of the sentence or clause. In normal speech verbs can also be negated without the need for an ending particle by putting /ʔɐt/ before them.
ខ្ញុំជឿ /kʰɲom cɨə/ – I believe
ខ្ញុំមិនជឿទេ /kʰɲom min cɨə teː/ – I don't believe
ខ្ញុំឥតជឿ /kʰɲom ʔɐt cɨə/ – I don't believe
Dialects are sometimes quite marked. Notable variations are found in speakers from Phnom Penh (which is the capital city), the rural Battambang area, the areas of Northeast Thailand adjacent to Cambodia such as Surin province, the Cardamom Mountains, and in southern Vietnam.[19] The dialects form a continuum running roughly north to south. Standard Cambodian Khmer is mutually intelligible with the others but a Khmer Krom speaker from Vietnam, for instance, may have great difficulty communicating with a Khmer native to Sisaket Province in Thailand.
The following classification of Khmer dialects is from Ferlus (1992),[20] as cited in Sidwell (2009).[6]
Northern Khmer, the dialect spoken in Thailand, is referred to in Khmer as Khmer Surin and, although it only began divergence from standard Khmer within the last 200 years, is considered by some linguists to be a separate language.[21] This is due to its distinct accent influenced by the surrounding tonal language, Thai, lexical differences and its phonemic differences in both vowels and distribution of consonants. Final /r/, which has become silent in other dialects of Khmer, is pronounced in Northern Khmer.
Western Khmer, also called Cardamom Khmer, spoken by a small, isolated population in the Cardamom mountain range extending from Cambodia into Thailand, although little studied, is unique in that it maintains a definite system of vocal register that has all but disappeared in other dialects of modern Khmer.[10]
A notable characteristic of Phnom Penh casual speech is merging or complete elision of syllables, considered by speakers from other regions as a "relaxed" pronunciation. For instance, "Phnom Penh" will sometimes be shortened to "m'Penh". Another characteristic of Phnom Penh speech is observed in words with an "r" either as an initial consonant or as the second member of a consonant cluster (as in the English word "bread"). The "r", trilled or flapped in other dialects, is either pronounced as an uvular trill or not pronounced at all. This alters the quality of any preceding consonant causing a harder, more emphasized pronunciation. Another unique result is that the syllable is spoken with a low-rising or "dipping" tone much like the "hỏi" tone in Vietnamese. For example, some people pronounce /trəj/ (meaning "fish") as /təj/, the "r" is dropped and the vowel begins by dipping much lower in tone than standard speech and then rises, effectively doubling its length. Another example is the word /riən/ ("study, learn"). It is pronounced /ʀiən/, with the "uvular r" and the same intonation described above.[22]
Khmer employs a system of registers in which the speaker must always be conscious of the social status of the person spoken to. The different registers, which include those used for common speech, polite speech, speaking to or about royals and speaking to or about monks, employ alternate verbs, names of body parts and pronouns. This results in what appears to foreigners as separate languages and, in fact, isolated villagers often are unsure how to speak with royals and royals raised completely within the court do not feel comfortable speaking the common register. Another result is that the pronominal system is complex and full of honorific variations.
As an example, the word for "to eat" used between intimates or in reference to animals is /siː/. Used in polite reference to commoners, it's /ɲam/. When used of those of higher social status, it's /pisa/ or /tɔtuəl tiən/. For monks the word is /cʰan/ and for royals, /saoj/.[1]
Khmer is written with the Khmer script, an abugida developed from the Pallava script of India before the 7th century.[23] The Khmer script is similar in appearance and usage to both Thai and Lao, which were based on the Khmer system, and is distantly related to the Burmese script.[23] Khmer numerals, which were inherited from Indian numerals, are used more widely than Hindu-Arabic numerals. The Khmer script is also used within Cambodia to transcribe hill tribe languages that have no writing system.[8]
The numbers[17] are:
0 | ០ | សូន្យ | (son) | /soːu̯n/ |
1 | ១ | មួយ | (muŏy) | /muːə̯j/ |
2 | ២ | ពីរ | (pi) | /piː/ |
3 | ៣ | បី | (bei) | /ɓəːj/ |
4 | ៤ | បួន | (buŏn) | /ɓuːə̯n/ |
5 | ៥ | ប្រាំ | (prăm) | /pram/ |
6 | ៦ | ប្រាំមូយ | (prăm muŏy) | /pram muːə̯j/ |
7 | ៧ | ប្រាំពីរ | (prăm pi) | /pram piː/ (also /pram pɨl/) |
8 | ៨ | ប្រាំបី | (prăm bei) | /pram ɓəːj/ |
9 | ៩ | ប្រាំបួន | (prăm buŏn) | /pram ɓuːə̯n/ |
10 | ១០ | ដប់ | (dâp) | /ɗɑp/ |
100 | ១០០ | មួយរយ | (muŏy rôy) | /muːə̯j rɔj/ |
1,000 | ១០០០ | មួយពាន់ | (muŏy poăn) | /muːə̯j pɔə̯n/ |
10,000 | ១០០០០ | មួយម៉ឺន | (muŏy mœn) | /muːə̯j məɨn/ |
100,000 | ១០០០០០ | មួយសែន | (muŏy sên) | /muːə̯j saːe̯n/ |
1,000,000 | ១០០០០០០ | មួយលាន | (muŏy léan) | /muːə̯j liːə̯n/ |